【月末特辑】9月最火AI论文 | 群体RL共享降本;SAPO让旧机也能训大模型
Description
本期的 10 篇论文如下:
[00:29 ] TOP1(🔥640) | 🤝 Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing(共享即关爱:基于集体RL经验共享的高效大模型后训练)
[02:49 ] TOP2(🔥341) | 🔒 A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code(A.S.E:一个用于评估AI生成代码安全的仓库级基准)
[04:59 ] TOP3(🔥218) | 🤖 VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model(VLA-Adapter:面向小型视觉-语言-动作模型的有效范式)
[07:07 ] TOP4(🔥212) | 🤖 The Landscape of Agentic Reinforcement Learning for LLMs: A Survey(面向大语言模型的智能体强化学习全景:一项综述)
[09:17 ] TOP5(🔥207) | 🤔 Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth(废话学:用深度解读无意义内容挑战大型语言模型)
[11:19 ] TOP6(🔥183) | 🤔 Why Language Models Hallucinate(语言模型为何产生幻觉)
[13:06 ] TOP7(🔥174) | 🧠 A Survey of Reinforcement Learning for Large Reasoning Models(大型推理模型的强化学习综述)
[15:32 ] TOP8(🔥160) | 🎬 LongLive: Real-time Interactive Long Video Generation(LongLive:实时交互式长视频生成框架)
[18:13 ] TOP9(🔥145) | 💡 Reverse-Engineered Reasoning for Open-Ended Generation(面向开放式生成的逆向工程推理)
[20:27 ] TOP10(🔥140) | 🤖 A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers(科学大型语言模型综述:从数据基础到智能体前沿)
<figure>
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递